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Abstract 

The broadcasting models on a d-ary tree T arise in many contexts such as discrete mathematics, 
biology, information theory, statistical physics and computer science. In this paper we consider 
the fc-colouring model where we assign a colour to the root of T and the remaining vertices are 
coloured as follows: For each child of a vertex v coloured <r v , assign it a colour chosen uniformly 
at random among all the colours but a v . A basic question here is whether the information of the 
assignment of the root affects the distribution of the colourings of the leaves. This is the so-called 
reconstruction/non-reconstruction problem. It is well known that dj In d is a threshold in the sense 
that 

• if k > (1 + e)d/ In d, then the information of the colouring of the root vanishes as the height 
of the tree grows and it does not affect the distribution of the colourings of the leaves. 

• if k < (1 — e)d/ In d, then the information of the colouring of the root affects the distribution 
of the colouring of the leaves regardless of the height of the tree. 

Despite this threshold, there is no apparent combinatorial reason why such a result should be true. 
Searching for an explanation, we note that the above implies that for k > (1 + e)d/ In d and two 
broadcasting processes that assign the root different colours the following holds: There is a cou- 
pling such that the probability of having vertices which take different colour assignments in the two 
processes reduces as we move away from the root. 

The description of such a coupling, especially a combinatorial one, is a stronger statement than 
the reconstruction/non-reconstruction threshold. As it would provide evidence on why we have 
this very impressive phenomenon. Devising such a coupling calls for understanding a complex 
combinatorial problem and it is a non-trivial task to achieve for any k < d. 

In this work we provide a coupling which has the aforementioned property for any k > 2d/\nd. 
Interestingly enough, the decisions that it makes are based on local rules, i.e. it considers only two 
successive levels of the tree. It should not be taken for granted that such a coupling is local for any 
k down to dj In d. 

Finally, we relate the existence of such a coupling to sampling fc-colourings of sparse random 
graphs, with expected degree d, for k < d. 

1 Introduction 

The broadcasting models and the closely related reconstruction problems on trees were originally stud- 
ied in statistical physics. Since then they have found applications in other areas including biology (in 
phylogenetic reconstruction (5] [16]), communication theory (in the study of noisy computation flU). 
Very impressively, these models arise in the study of random constraints satisfaction problems such as 
random fc-SAT, random graph colouring etc. That is, these models on trees seem to capture some of the 
most fundamental properties of the corresponding models on random (hyper)graphs (see fl3l ). 
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The most basic problem in the study of broadcasting models is determining the reconstruction/non- 
reconstruction threshold. I.e. whether the configuration of the root affects the distribution of the con- 
figuration of the leaves of the tree. The transition from non-reconstruction to reconstruction can be 
achieved by adjusting appropriately the parameters of the model. Typically, this transition exhibits 
threshold behaviour. So far, the main focus of the study was to determine the precise location of the 
reconstruction/non-reconstruction threshold in various models. 

In this work we focus on the colouring model, for which the reconstruction/non-reconstruction 
threshold is known lfT7l [191 l20l |4j. We investigate further the phenomenon searching for a natural, 
i.e. combinatorial, explanation of why the information decays in the non-reconstruction regime. An 
explanation which has been, somehow, elusive for a number of colours k smaller than the number of 
children per non-leaf vertex, d. As far as the reconstruction regime is regarded a satisfactory, combi- 
natorial, explanation is already known |[T71IT9l . Here we provide a combinatorial view of the problem 
by providing a coupling between two broadcasting processes which work for k well below d, i.e. for 
k > Id I In d. That is, it implies non-reconstruction for k > 2d/ In d. Even though the coupling is not 
optimal, it reveals a great deal of why we have non-reconstruction on trees. Interestingly enough, the 
decisions that it makes are based on local rules, i.e. it considers only two successive levels of the tree. 
The reader should not take for granted that such a coupling is fairly local for every k down to dj In d. 

Finally, recent advances in sampling colouring algorithms (see |j9l) allow to relate such a coupling 
to sampling /c-colourings of random graphs and regular graph for k smaller than the expected degree d. 
See further discussion in Section [PI 

1.1 The model 

The broadcasting models on a tree T are models in which information is sent from the root over the 
edges to the leaves. We assume that the edges represent noisy channels. For some finite set of spins 
S = {1,2,..., k}, a configuration on T is an element of S T , i.e. it is an assignment of a spins of £ 
to each vertex of T. The spin of the root r is chosen according to some initial distribution over E. The 
information propagates along the edges of the tree as follows: There is a k x k stochastic matrix M such 
that if the vertex v is assigned spin i, then its child u is assigned spin j with probability Mi j. 

Here we focus on the model which is known as the (proper) /c-colouring model (or fc-state Potts 
model at zero temperature). In particular, we assume that the underlying tree T is a complete d-ary tree, 
it is finite with height h and we let denote the leaves of T. Also, for the matrix M it holds that 



Broadcasting models and more specifically colouring can be considered as Gibbs measures on trees. 
That is, assuming that the broadcasting process over T starts with root r coloured i, then the /c-colouring 
after the processes has finished is a random ^-colouring of T conditional the colouring of r. We let m 
denote the uniform distribution over the configurations of T conditional that r is assigned i. 

Definition 1 We say that a model is reconstructible on a tree T if there exists S/or which 



where \ \-\\L h denotes the total variation distance of the projections of yn and fij on L^. When the above 
limit is zero for every i, j, then we say that the model has non-reconstruction. 

(Non)-Reconstructibility expresses how information decays along the tree. As a matter of fact, non- 
reconstruction is equivalent to the mutual information between the colouring of root r and that of Lh 
going to zero as h grows (see IPT51 ). Also, non-reconstruction is equivalent to Gibbs distribution being 




for i ^ j 
otherwise. 
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extremal (see ifTTTO . That is, typical colourings of the leaves do not provide information for the colouring 
of the root. The notion of extremality is weaker than the well-known Gibbs uniqueness where we require 
that every configuration of the leaves should not provide information for the colouring of the root. 

An early work about reconstruction/non-reconstruction problems on tree is that of Kesten and Stigum 
in lfT2l . The authors there present a general result for reconstruction, the so called "Kesten-Stigum 
bound" that states that reconstruction holds when X 2 d > 1, where A is the second largest eigenvalue of 
M in absolute value. This bound is sharp for a lot of models, e.g. Ising model (see (8]). In |[T5l it was 
shown that there are models where the Kesten-Stigum bound is not sharp, e.g. the binary models where 
M is sufficiently asymmetric or ferromagnetic (/-state Potts model with q large. As far as the (proper) 
fc-colouring model is regarded the Kesten-Stigum bound is not tight but we know the reconstruction/non 
reconstruction threshold quite precisely. From lfT7l IT91 l20l l4l we derive the following theorem: 

Theorem 1 For fixed e > and sufficiently large d, the following is true for the k-colouring model on 
a d-ary tree T: 

• Ifk>{l + e)d/ In d, then the model is non-reconstructible. 

• Ifk< (1 — e)d/\nd, then the model is reconstructible. 

Remark 1 We should remark that reconstruction bound is from lUTI Il9l and is based on analysing a 
simple reconstruction algorithm. As a matter of fact the reconstruction condition there is more precise 
than thar in TheoremQ] i.e. it should hold d > k[ln k + In In k + 1 + o(l)]. 

Remark 2 The non-reconstruction bound is from ||20] |4]. The result in ll20l provides a very precise 
condition for non-reconstruction, i.e. d < A; [In k + In In k + 1 — In 2 — o(l)]. In (4| the reader can find 
further interesting results about the problem. 

Unfortunately, neither of ll20l |4] give explicit evidence on why do we have the phenomenon of infor- 
mation decay on the trees. What is the natural (combinatorial) reason why such a result should be 
true. Searching for an explanation for the non-reconstruction we observe that Theorem Q] with Coupling 
Lemma (see Q) imply the following corollary. 

Corollary 1 For any fixed e > 0, sufficiently large d and k = (1 + e)d/ In d the following is true for the 
k-colourings of the d-ary tree T: Any two broadcasting processes which assign the root different colours 
can be coupled such that the probability of having vertices that take different colour assignments in the 
two processes reduces as we move away from the root. 

That is, somehow there is a rule which specifies how someone should correspond the choices of colour- 
ings in the first broadcasting process to the choices of the other process such that the probability of having 
a set of vertices that take different colours reduces as the distance of the set from the root increases. The 
reader should note that the construction of such a coupling is far from trivial for any k < d. 

Here we address the problem of constructing a coupling as specified in Corollary [T] based on local 
combinatorial rules. By local we mean that once the first process decides on colouring a fairly small set 
of vertices, then we should be able to know how the other process should colour the same set of vertices. 
Such a construction is an inteiplay among the combinatorial structure of the tree, its colourings and the 
Gibbs distribution of the colourings. In this work we provide the following result: 

Main Result: We construct a coupling for two broadcasting processes on a d-ary tree T that is combi- 
natorial, local and has the properties specified in Corollary [T]for k = (2 + e)d/ In d, for fixed e > and 
sufficiently large d. 
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1.2 Further Motivation - Non Reconstruction in Random Graphs & Sampling 

It is believed that the non-reconstruction/reconstmction phenomenon determines the dynamic phase 
transition in the random graph G n ^ m Ifl3l . Where G n>m denotes the random graph on n vertices and m 
edges while we let d be the expected degree, i.e. d = 2m/n. In this context we take d to be fixed. 

The dynamic phase transition is related to the geometry of /c-colourings of G n ^ m and it was pre- 
dicted by statistical physicists in lfT3l . based on ingenious however mathematically non-rigorous argu- 
ments. Let us be more specific. For typical instances of G(n, m), it is well known that the chromatic 
number \ ~ jo ( see ^ ne l" ste P Replica Symmetry breaking hypothesis (see |[T3l ) suggests 
that considering the space of /c-colourings of G n ^ m as k varies from large to small, we will observe the 
following phenomenon: For k = (1 + e)d/\nd (i.e. greater than 2\) all but a vanishing fraction of 
fc-colourings form a giant connected ball. The notion of connectedness suggests that starting from any 
colouring we can traverse the whole set of solutions by moving in steps. Each steps involves changing 
only a very small -constant- number of colour assignments. However, for k = (1 — e)d/\nd the set 
of fc-colouring shatters into exponentially many connected balls with each ball containing an exponen- 
tially small fraction of all fc-colourings. Any two colourings in different balls are separated with linear 
hamming distance. For a rigorous description of the shattering phenomenon see HI. 

We should remark that the dynamic phase transition in G n ^ m (roughly) coincides with the extremality 
/non-extremality transition of the colourings in a <i-ary tree. Further investigation into this coincidence 
yield the authors in lITOl to developed a sufficient condition for the tree and random graph reconstruction 
problem to coincide. In lfl4l this condition was verified for symmetric models like colouring. 

It is believed we can have approximatd3 random colouring of G n . m in the whole regime of non- 
reconstruction. Recently, the author of this paper in ||9] suggested a new algorithm for sampling colour- 
ings of G n ^ m with constant expected degree. Interestingly enough the accuracy of the algorithm depends 
directly on non-reconstruction conditions. The idea there is that we first remove edges of G nj7n until it 
becomes so simple that we can take a random colouring in polynomial time. Then, we rebuild the graph 
by adding the deleted edges one by one while at the same time we update the colouring of some vertices 
so as the colouring of the graph with the added edge to remain random. This algorithm requires at least 
(2 + e)d colours. However, since its accuracy depends on non-reconstruction conditions it is reasonable 
to expect that we can have an improvement by requiring even less colours.The algorithm does not exploit 
fully its dependency on non-reconstruction due to its colouring update rule. A new, improved, update 
rule is needed. Such an improvement could possibly reduce the minimum number of colours that the 
algorithm requires down to (1 + e)d/ In d. 

Very good candidates for improved updating rules are couplings as the one we present here. Of 
course so as to use it for sampling it requires solving non-trivial technical issues (which go beyond 
lITOl IT4Tl >. The close relation between random colourings of d-regular trees and the random colourings of 
G n ,m with expected degree d suggests that such an extension is a reasonable thing to ask. The situation 
is very similar if someone considers sampling colourings of random d-regular graphs. 

Remark 3 Of course, there are other approaches for sampling colouring that vary from combinatorial 
ones (e.g. Markov Chain Monte Carlo (UdH) to numerical ones (e.g. Belief propagation [7 ]). However, 
it seems that the setting in [|9] is substantially easier to analyse for k being a small function of d. 

1.3 A basic description of the coupling. 

Assume that we have the <i-ary tree T and two broadcasting processes. The first one fc-colours T as X 
and the other as Y. Assume that the root r of T is coloured as X(r) = c and Y(r) = q while c / q. 

Consider first the following recursive naive coupling of the two processes. Start from the root r 
down to the leaves. For each vertex u G T such that X(u) = Y(u) couple every w, descendant of u, 

1 With a reasonable accuracy 
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by using identical coupling, i.e. X(w) = Y(w). On the other hand, if X(u) ^ Y(u), then couple the 
colourings of the children of u according to an optimal coupling for each child. 

The above coupling has the property that for any vertex w such that X(w) / Y(w), under both 
X and Y, there is a path which includes r and w and its vertices are coloured using only X(r) and 
Y(r). That is the paths of disagreement in both colourings propagates as bicolored paths that use the 
colours c, q. For a number of colours k < d, we expect that the above coupling generates an ever 
increasing number of disagreeing vertices as it moves from the root down to the leaves. As a matter of 
fact the number of disagreeing vertices at each level grows as a supercritical branching process, i.e. the 
probability of having disagreement at the leaves is strictly positive, regardless of their distance from the 
root. 

Our coupling improves on the naive one. Consider the root r and let Ni be the set of vertices which 
contain the i-th child of r and the children of i (i.e. two levels below the root). We call "bad" the 
colouring X(Ni) if X(i) = q and i has a child j such that X(j) = c. Similarly, Y(Ni) is bad if 
Y(i) = c and i has a child j such that Y(j) = q. It turns out that the challenge is to deal with these 
bad colourings. The problem reveals by just considering the naive coupling. There, if X{Ni) is bad, 
then Y(Ni) should be bad too. For such a pair the identity coupling is precluded and the creation of 
disagreements is inevitable. For k < d, the naive coupling fails due to the fact that it generates too many 
bad pairs X(Ni),Y(Ni). 

It is direct to see that for k < d it is unavoidable that there are a lot of bad colouring among X(Ni)s 
and y(JVj)s. The basic idea, here, is to avoid coupling them together. To this end we use the following 
fact which is true as long as k > (2 + e)d/\nd: Consider X(Nj) conditional that A) it is a bad and 
B) there is at least one colour that is not used by X(Nj). For such X(Nj) it is highly likely that there 
is another child of r, e.g. vertex s, such that the joint distribution of the colouring of the children of s 
under Y(N S ) is identical to the joint distribution of the children of j under the colouring X(Nj). For 
convenience, we call Y(N S ) j-good, since X(Nj) is badH. 

Based on the above observation, the target now is to couple the colourings X(Ni)s and Y(Ni)s such 
that if X(Ni) satisfies the conditions A) and B), stated above, then Y(Ni) is a z-good colouring and vice 
versar]. Then we can have identical coupling for the colourings of the children of the vertex i. On the 
other hand, for X(Ni) (or Y(iVj)) a bad colouring that either uses all of the k colours or there is not 
any z-good colouring, it is inevitable that disagreements are generated. For all the rest colourings, i.e. 
neither bad nor good we have identical coupling. In the next section we provide further details about our 
coupling on a more technical basis. 

Working as described in the previous paragraph, the number of disagreements drops dramatically, 
compared to naive coupling. As a matter of fact the number of disagreeing vertices grows as a subcritical 
branching process, i.e. the probability of having disagreement at the leaves drops exponentially with the 
distance of the leaves from the root. 

Remark 4 One could hope that employing directly the ideas of this improved coupling to larger neigh- 
bourhoods could reduce the bound for k down to (1 + e)d/ In d. However, technical reasons suggest that 
a non-trivial improvement is required for getting k below (2 + e)d/\nd, rather than only considering 
larger neighbourhoods. 

Remark 5 The update rule in the sampling algorithm in is, somehow, an extension to G n>m of what 
we call here as naive coupling. In Section [Ol we suggest extending this new coupling as an update rule 
for the algorithm in ||9]. This would yield to a lower bound for k below the expected degree d. 

2 Of course, for the bad colourings among Y(Ni)& we can find good colourings among X(Ni)s, as well. 
3 If Y(Nj) satisfies the conditions A) and B), then X(Nj) is a ji-good colouring. 
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Notation. We use small letters of the greek alphabet for the colourings of G, e.g. a, r. The capital 
letters denote random variables which take values over the colourings e.g. X, Y. We let a v denote the 
colour assignment of the vertex v under the colouring a. Similarly, the random variable X(v) is equal 
to the colour assignment that X specifies for the vertex v. For an integer k > we let [k] = {1, . . . , k}. 

2 Coupling 

In this section we present the coupling in full detail. We consider a complete d-ary tree T. Let //(■) 
denote the uniform distribution over the A;-colourings of T, for some k we define later. We consider two 
broadcasting process such that the first one assigns the root r colour c while the second one assigns the 
root q. To avoid trivialities assume that c / q. Finally, we let X, Y be the colourings that the first and 
the second processes assign to T, respectively. It holds that X is distributed as in fi(-\X(r) = c) while 
Y is distributed as in fj,(-\Y(r) = q). We proceed by introducing some useful concepts. 

2.1 Preliminaries 

Lists of Colour Assignments. We let Lx G [k] d be an ordered list which contains the colours that are 
assigned to the children of the root r under the colour assignment X, but without revealing which colour 
gets to which child. Since X is distributed according to fj,(-\X(r) = c) each entry Lx{i) is a random 
choice from [&]\{c}. Additionally, for every % G [d] we let L x be the corresponding lists of the colour 
assignments of the children of the vertex that is going to be assigned the colour Lx(i). We, also, have 
the corresponding lists w.r.t the colouring Y (see e.g. Figured). 

Remark 6 Given Lx, the colouring of the children of r under the colouring X corresponds to a random 
permutation of the elements of Lx- That is, if 7r is a random permutation of the elements (1, . . . , d), 
then for the i-th child of r we can set X(i) = Lx{i^{i))- We work similarly for the grandchildren of r 
with the lists L l x . 

Each of L x ,Ly will be classified to at least one (possibly more) of the following 4 categories of lists. 

bad: We call "bad" every list L x (L l Y ) which has the property that c G L % x (q G L l Y ) while Lx(i) = q 
(Ly(i) = c). E.g. see Figure |2] 

rescuable: A list L x (or L Y ) is rescuable iff it is "bad" and it contains less than k — 1 different colours, 
j-special: Given a rescuable list L J Y , a list L % x is called "j-special" if the following holds: 

1. L x (i) / qwAL x {i) $ L Y . 

2. One of the following two happens: 

• L x contains q but it does not contain c. 

• L x contains c but it does not contain q. 

W.r.t. to a rescuable list L x , we define the j-special lists L l Y analogously (e.g. see Figure |3). 
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M») Ly(j) Ly(t) 




Figure 3: The lists L Y and L Y are both i-special (L x is rescuable). Additionally, the list L Y is i-good. 

j-good: A list L x is called "j-good" if it is j-special and q G L l x while c ^ U x . Similarly the j-special 
L Y is j -good if c G L l Y and g ^ Ly (e.g. see Figure [3). 

Lemma 1 Assume that for some i,j G [d] /? /jo/Js ?/zctf Lj^ is rescuable and U Y is j-good. Then L x 
and L l Y are identically distributed. 

For the proof of Lemma Q] see in Section 157X1 
2.2 The coupling 

For both colourings X, Y consider the lists of colour assignments Lx, Ly as well as the lists U x and 
U Y , for i = 1, . . . , d. First, we are going to couple the lists. Then, given the colours in the lists we are 
going to define the actual colouring of the vertices (see Remark©. 

Roughly speaking, the coupling works as follows. In a first phase it reveals only a certain part of 
information about the lists Lx, Ly, L x and L Y . That is, it reveals the "bad" lists, the "rescuable" and 
the "special" lists. In a second phase, it uses this information about the lists to construct a bijection 
/ : [d] — > [d] among L x s and L Y s. The use of this bijection is the following: If f(i) = j, then when 
we reveal the full information about the lists we couple optimally Lx{i) with Ly(j) and L x with L Y , 
conditional, of course, the information we have already revealed. The bijection / is constructed so as 
to minimize the number of disagreements at the grandchildren of r. In a third phase, we reveal the full 
information about the lists by using /, as described above. Finally, given the full information for the 
lists we reveal the assignments of X, Y for the (grand)children of r. Note that if f(i) = j then the child 
of r that gets Lx{i) under X will get Ly(j) under Y. Additionally, the grand child of r that is assigned 
the colour L l x (t) under X is going to take the colour L Y {t) under Y. 

The pseudocode for the Phase 1 follows: 

Phase 1: /*Partial revelation of the lists*/ 

1. Reveal only for which i we have Lx(i), Ly(i) G {c, q}. Couple the choices of Lx and Ly such 
that if Lx(i) = q, then Ly(i) = c. 

2. For each i such that Lx{i) = q and Ly(i) = c reveal whether L l x and L l Y is "bad" or not. We 
use coupling such that the result of revelation to be the same for both L l x and L Y . 

3. For each pair of bad lists (L l x ,L l y ) reveal whether they are "rescuable". We use coupling such 
that the result of revelation to be the same for both L l x and L' Y . The coupling is so that the set of 

colours in [k]\{c, q} that are not used to be the same for both rescuable lists L X ,L Y . 

4. Partition the non-bad L x s and L y s to (maximal) equally sized sets. The partitions are as many 
as the rescuable pairs and each rescuable pair is associated to exactly one partition. That is, the 
rescuable pair (L x ,L l Y ) is associated to a set of indices A{ C [d] such that the following holds: 
For any t G Ai L x and L Y belong to partition associated to (L l x , L Y ). 
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5. For each i and for each j G Ai we reveal if the pair (L J X ,L Y ) consists of i-special lists. We use 
coupling such that either both lists in the pair are i-special or both are not. 

We should recognize the bad lists as sources of potential disagreements in the coupling. However, our 
attempt is to eliminate the disagreements caused only by the rescuable ones (a subset of bad one£|). 
For this reason each rescuable pair of lists (L x , L l Y ) is associated with the i-special lists in Ai. Before 
proceeding, let us make the following clarification. 

Remark 7 For the i-special pair (L x , L Y ), above, we do not necessarily have Lx{j) = Ly{j). 

In the next phase we construct the bijection /. Basically, what we need to do is for each rescuable pair 
(L l x ,L Y ) to find an i-good pair (L X ,L J Y ) among the i-special pairs in A^ Once we find one, we set 
f(i) = j and /(j) = i. That is, when we reveal the full information of the lists we couple L l x with L Y 
and L x with U Y . Then, from Lemma[T]we can have identity coupling for both pairs. Thus, we eliminate 
the disagreement we would have had if we had coupled U x and U Y , together. 

However, so as to have an i-special pair (L J X , L Y ), for j G Ai, with both lists i-good we should 
couple the lists such that L J X ^ L Y . To be more specific, we have to reveal whether c G L x and 
q ^ L x or the opposite, i.e. c ^ I? x and q G L J X . Once we have this information for L 3 X , e.g. assume 
that we have c G L x and q ^ L x , then the coupling should decide the opposite for L Y , i.e. c ^ L Y and 
q G L Y . It is straightforward that when we reveal an i-special pair (L X ,L Y ) in such a way it is z-good 
with probability 1/2. With the remaining probability we end up with a pair (L X ,L Y ) which is not 
i-good, i.e. useless, but even worse we cannot couple L x with L? Y identically. Call such a disagreeing 
pair as i-fail (see example in Figure HJ the upper pair is i-fail). It is straightforward, now, that as we 
search for an i-good pair it is possible that we generate extra sources of (potential) disagreements. To 
this end we use the following lemma. 

Lemma 2 Assume that the i-special pairs (L x , L Y ) and (L x , L Y ) in Ai are revealed and (L x , L Y ) 
is i-good while (L S X ,L Y ) is i-fail. Then, L x is identically distributed to L Y and L Y is identically 
distributed to L s x . 

For a proof of Lemma|2]see in Section [5T2l Figure @]gives a schematic representation of what is stated 
in Lemma [2] The arrows (Figure [4) show the pairs of lists that are identically distributed. 

To deal with the potential disagreements generated by z-fails we reveal extra i-good pairs. In partic- 
ular, we work as follows: 

Phase 2. /*List Association*/ 

For each rescuable pair (L l x ,L Y ) do the following: 

1. Reveal each i-special pairs in Ai whether it is i-good or i-fail until either of the following two 
happens: 

• the number of i-good pairs exceeds the number of i-fails by one, 

• there are no other i-special pairs in Ai to reveal. 

2. Reveal the remaining unrevealed i-special pairs, if any, by using identity coupling. 

3. Match every i-good pair with one i-fail such that no two i-good pairs are matched to the same 
i-fail pair. 

4. If the i-good pair (L J X ,L Y ) is matched with the i-fail (L S X ,L Y ), then set f(j) = s and f(s) = j. 
4 For the values of k we are interested in, it is highly unlikely that a bad list is non-rescuable. 
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Figure 4: Matching between i-fail and i-good pairs. The coupling between Lx(s)-Ly(t) and Lx{t)- 
Ly(s) is done after the good/fail revelations. 

5. If there is an i-good pair (L J X ,L Y ) for which there is no z-fail to be matched, match it with the 
rescuable pair (L X ,L Y ). Also, set f(i) = j and f(j) = i. 

6. For each j € Ai such that (L x , L Y ) is not matched yet, match it to itself and set f(j) = j. 



Ideally, Phase 2 generates a number of i-good pairs which exceed the number of i-fails by one. If this is 
the case, then from Lemma [2] we can construct the function / such that no disagreement is generated by 
any z-fail pair in the full revelation of the lists. Furthermore, there is an extra i-good pair to be matched 
with the rescuable pair (L l x , L l Y ) and no disagreement is generated by the rescuable pair as well (by 
Lemma [T]). 

Of course, it is possible that the number of the i-good pairs is not sufficiently large. Then, we end 
up with the rescuable pair (L x , L l Y ) with some i-fails which cannot be matched with any i-good pair. 
These pairs are matched to themselves and some disagreements are going to appear in the full revelation. 
However, we show that the expected number of disagreements vanishes as long as k > (2 + e)d/ In d. 
We now, proceed with the Phase 3. 

Phase 3: /*Full revelation*/ 



1. For every s, t such that f(s) = t, couple optimally Lx(s) with Ly(t) as well as L s x with L Y . 

2. Reveal which element of the list Lx is assigned to which child of r and which element of L x 
goes to which grandchild of r, as Remark [6] specifies. 

3. Assuming that v, child of r, is such that X(v) is set Lx(s), then we set Y(v) equal to Ly(t), 
where t = f(s). Also, for u, child of v , such that X(u) set L s x (j) we set Y(u) equal to L Y (j). 



Applying the same coupling for the grandchildren of the root, at the end we get the full colourings X and 
Y . For completeness, in Section|6]we provide the full pseudocode of coupling as a recursive procedure. 
A very basic result is the following theorem. 

Theorem 2 For c,q G [k], assume that in the above coupling it holds X(r) = c and Y{r) = q, where 
r is the root vertex ofT. Then at the end of the coupling, X and Y are distributed as in fi(-\X(r) = c) 
and n(-\Y(r) = q), respectively. 
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Proof: Theorem follows by noting that for every list, conditional on the information that is already 
known to us, we reveal some information by using the appropriate distribution. □ 



Furthermore, from the description of the coupling the following corollary is direct. 

Corollary 2 The disagreements in the coupling have three different sources: 

1. Pairs of bad lists which are not rescuable. 

2. Pairs of rescuable lists for which it was impossible to find a good pair. 

3. Pairs ofi-fail lists, for some i, which are not matched to an i-good pair. 

Proposition 1 Consider the above coupling between X and Y and let Wi be the number of vertices u 
at level I such that X(u) ^ Y(u). For fixed e > 0, sufficiently large d, k = (1 + e) j^? and every even 
integer I > it holds that 

Using Proposition [T]it is direct to see that our combinatorial construction implies the following theorem. 

Theorem 3 For fixed e > and sufficiently large d, the following is true for the k-colouring model on 
a d-ary tree T:Ifk = (2 + e)d/ hid, then the model is non-reconstructible. 

Proof: Take k = (2 + e)d/lnd. Let X and Y be distributed as in (i(-\X(r) = c) and p,(-\Y(r) = q), 
respectively, while their joint distribution is specified by the coupling we presented. Let be the set of 
vertices that contains all the vertices of T at distance h. We take h even. By Coupling Lemma we have 

M-\X(r) = c) - fi(-\Y(r) = q)\\ Lh < Pr[X(L h ) + Y{L h )}. (1) 

Let Wh be the number of vertices m£L/, such that X{u) ^Y{u). It holds that 

Pr[X{L h ) / Y(L h )] = Pr[W h > 0] < E\W h ] [by Markov's inequality] 

< Id ul E +2j [from Proposition [D. (2) 

The theorem follows by combining £[]) and ©. □ 



3 Proof of Proposition [j] 

Consider in the coupling two vertices v, w G T at the same level where I is even. Given the colourings 
X(v), Y(v) and X(w) and Y(w), and assuming that X(v) ^ Y(v) and X(w) ^ Y(w) observe that 
whether the descendants of v disagree or not does not dependent on what happens at the descendants of 
w and vice versa. This observation yields to the following: For some vertex v € T that X(v) / Y(y), 
let V v be the number of disagreements two levels below the vertex v. Then, it holds that 

E[Wi\ Wi- 2 ] = E[V V ) ■ W1-2 for even I > 0. 

Taking the average from both sides and working out the recursion it is easy to get that 

E[Wi] = (E[D v ]) 1 / 2 . 
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The proposition will follow by bounding appropriately E[D V ]. So as to bound E[D V ] we need to bound 
the number of disagreements that are generated by each of the three sources of disagreement specified 
in Corollary |2] It, always, holds that D v < d 2 , since T is a <i-ary tree. 

Consider the following quantities related to the vertex v: Let (3 V denote the number of bad lists. Let 
5k be the probability for a bad list to be rescuable, for a given number of colours k. Finally, given some 
rescuable list L 3 X let hi be the number of j-special lists in the partition it is assigned. Let the event A 
denote that at least one of the following three is true 

1. p v > 100 In d. 

2. There is at least one bad list which is not rescuable. 

j 4 e-1 

3. There is a rescuable list L x that is corresponded to a partition with less than d 5 *+« j-special lists. 
It is direct to get that 

E[D V ] < d 2 Pr[A] + E[D V \A% (3) 

where we use the rather crude overestimate that conditional on the event A occurs all the d 2 descendants 
of v are disagreeing. It suffices to bound appropriately Pr[A] and E[D V \A C ]. To this end, we use the 
following propositions. 

Proposition 2 For k = (1 + e)d/ In d and for sufficiently large d, we have that 

E[D V \A C ) < (T - 102 St. 
Proposition 3 For k = (1 + e)dj In d and for sufficiently large d, we have that 

Pr[A\ < 5d~ 250 . 

Form Proposition |2j Proposition |3]and for sufficiently large d, we have that © implies that 

E[D V ] < ^ ai Sr. 

The proposition follows. 

3.1 Proof of Proposition |2] 

Since we have conditioned on A c , we have that A) (3 V , the number of bad lists, is at most 100 In d, B) all 

4 e-1 

the bad lists are rescuable and C) every rescuable list L' l x is assigned to Ai which contains at least d 5 x + e 
^-special lists. Clearly, from (A) and (B) we deduce that the number of rescuable lists is equal to j3 v . 

Let the set Aj contain the pair (L X ,L Y ), for s G ij U {i}, if L s x should be coupled with L Y in 
Phase 3 while identical coupling of the lists is impossible. We remind the reader that in the second phase 
of the coupling we reveal which of the i-special pairs in Ai are i-good or not, i.e. during the steps 1 
and 2. During these revelations it is possible that we introduce pairs which are z-fails which are coupled 
together (due to lack of i-good pair). These z-fails, for which it was imposible to find i-good pair belong 
to Aj. Of course, the rescuable pair (L X ,L Y ) can also be in Aj, so long as L x should be coupled with 
Ly in Phase 3. 

Conditional on the event A c the following holds: Consider (L x , Ly) £ Aj which is non-rescuable. 
It is easy to see that we can couple Lx(t), Ly(t) such that Lx(t) = Ly(t). Also, given that Lx{t) = 
Ly{t) it holds that c G L x and q ^ L x , also q G L Y and c ^ Ly. Furthermore, all the rest colours, i.e. 
all but c, q, Lx{t), are symmetric for both lists L x and L y . Clearly, we can couple L x and L Y , such 
that if L x (s) = c then L Y (s) = q while if L x (s) / c, then L x (s) = L Y (s) for any s G [d]. Also, we 
can have exactly the same coupling for the rescuable pair (L l x ,L Y ) G Aj. 
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Let Zt be the number of disagreements that are generated by the coupling of the pair (L x , L Y ) G Aj. 
Also, let Qi be the number of all disagreements that we get from coupling the pairs in Aj after the third 
phase. It holds that 

E[Qi\A c ] = E[\Ai\\A c ] ■ E[Zj\A c ]. 
Also, by the linearity of expectation it holds that 

E[D V \A C ] < (100lnd)E[Qi\A c ] [as A c assumes that j3 v < 100 In d] 

< (WOlnd) ■ E[\Ai\\A c }- E[Zj\A c }. (4) 

The proposition will follow by bounding appropriately i?[|Aj||A c ] and i?[Zj|A c ]. 

As far as i£[Zj|A c ] is concerned notice the following: Conditional on A c , every list L x G Aj has 
a number of entries with colour c that is binomial distributed B(d,l/k), conditional that there exists at 
least one entry which is c. For L Y E Aj we have the same conditions w.r.t. the colour q. Coupling 
the lists (L X ,L Y ) G Aj as specified above, the number of disagreements is exactly the number of 
occurences of c in L x (or the occurences of q in L Y ). Then, it follows easily that 

d 

E[Zj\A c ] = s ■ Pr[c appears s times in L x \c appears at least once in L x ] 



s=0 



l * M / l \ s / 1 x d ~ s 



fe-1 

< 2^ [since l-(l-^)"> 1/2] 

< 2 In d. [since k = (l + e)d/ In d] (5) 

As far as £'[|Aj||A c ] is concerned, we work as follows: Let Si be the set that contains all the i-special 
pairs in Ai as well as the rescuable pair (L x , L Y ). W.l.o.g. assume that i = 1 while the indices of the i- 
special pairs in Si are from 2 to |5j|. Let the 0-1 matrix S = |<Sj| x 2 be defined as follows: 5(1, t) = 1, if 
c G L x and q £ L x , otherwise, i.e. c ^ L x and q £ L x , 5(1, t) = 0. Similarly, 5(2, t) = 1 if c ^ L Y 
and gr G Ly , otherwise 5(2, t) = 0. It is clear that for the i-special pair (L x , L Y ) that is i-good, it holds 
that (5(1, t), 5(2, t)) = (0,1) on the other hand if (L X ,L Y ) is i-fail, then (5(1, t), 5(2, t)) = (1,0). 

Remark 8 The second phase of the coupling specifies how 5(1, j) and 5(2, j) are con - elated with each 
other. In particular, the following holds: if X)}=i ( 1 , J ) — 5(2, j)) > 0, then 5(1, i) and 5(2, i) are 

coupled so as to get complementary values. Otherwise, i.e. if Y^j=i 0->(1j j) ~ <5(2j j)) = 0, they are 
coupled identically. 

Since we have assumed that the values in (5(1, 1), (2, 1)) are related to (L x ,L l Y ), by defini- 
tion it holds that (5(1, 1), (2, 1)) = (1,0). Furthermore, for each t = 2...|5j| and as long as 

(5(1, j)- 5(2, j)) > Owe have 

(qn A o/ 9 _ / with probability 1/2 

' " \ (0,1) with probability 1/2. 

For the matrix 5 we have the following lemma. 

Lemma 3 The cardinality ofA{ and S are related as follows: 

N 

| A*] =^5(M)-5(2,£), 
t=i 

where N is the number of columns of the matrix S. 
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Proof: First notice that 5(1, 1) — 5(2, 1) = 1. The coupling during the second phase assigns comple- 
mentary values to each pair 5(1, t), 5(2, t) as long as R t = Y%Zi [^(l, t) - 5(2, t)] > 0. Once R t = 
it sets 5(1, t) = 5(2, t), i.e. Rt remains zero for the rest values of t. 

Let T be the maximum t such that 5(1, t) / 5(2, t). Clearly, it suffice to show that 

T 

| Ai| =^5(l,i)-5(2,t). 
t=i 

For t < T, the fact that 5(1, t) = 1 (and consequently 5(2, i) = 0) suggests that we have revealed an 
i-fail. On the other hand, if 5(1, t) = (and consequently 5(2, t) = 1), then it suggests that it has been 
revealed an f-good pair. This observation implies that the sum Ylt=i "^0-! * s ec l ua l to the number of 
z-fails we have revealed, while Ylt=i <^(2) i s equal to the number of f-good pairs. 

Since we can match an i-fail with an i-good pair to avoid generating disagreements, the number of 
pairs which do not admit identical coupling, i.e. the i-fail and possibly the rescuable pair, is equal to 

T N 

J>(l,i) -S(2,t) = £S(l,t) - 5(2, t). 
t=i t=i 

The lemma follows. □ 



Proposition 4 Let N be the number of columns of S. Then for sufficiently large N it holds that 



E 



N 



£(S(l,i)-S(2,i)) 

3=1 



< 



2.3 

7T 



0.43 In N 



(6) 



For a proof of Proposition @]see in Section|4] Using Proposition @] and Lemma[3]and the assumption that 
the number of i-special pairs in A4 is at least d 5 , we get 



,4(6-1) 



' 2.3 \ - 43 W+tj 



lad 



< d -0.344f^iln(^) < d -0.107|^_ 
Plugging into (0]) the inequalities (f5]> and |7]) we get that 



E[D V \A C ] < 



200 In 2 d 

,0.107^- ' 



The proposition follows by taking sufficiently large d. 
3.2 Proof of Proposition |3] 

For the quantities, (3 v ,5k and h v we defined in Section [3] we have the following proposition. 
Proposition 5 For k = (1 + e)d/ In d the following are true: 

d 



Pr 



Pv>(l + x) 



k-1 



:ic>(.r) 



where <p(x) = (1 + x) ln(l + x) — x,for real x > 0. A/50, ;Y holds that 



(V) 



(8) 



(9) 



13 



Finally, for any c > it holds that 



Pr 



e-l 
d J + e 

K < n ; j \Pv < dnd 
2cm a 



< 



exp 



e-l \ 

Sclnd 



(10) 



Let the events E\ ='j3 v > 100 In d", E 2 =' there is at least one bad list which is not rescuable" 

j 4 e-l 

and £3 ="there is a rescuable list that is corresponded to a partition with less than d 5 !+ e j-special 
lists". It is direct that 



Pr[A] = Pr 



U 

,i=l 



Ei 



< 



Y^Prm. 



(ii) 



i=i 



The proposition will follow by bounding appropriately the probability terms Pr[E\],Pr[E2] and Pr [£3] 
in the above relation. As far as Pr[E{\ is regarded note that for 1 + xq = 98(1 + e) it holds that 



Pr[Ex] < Pr 

The above inequality holds since 

H d 2d 

fc-i — k k 2 

Ind 21n 2 d 
< + 



/3 V > (1 + Xq) 



d 



jfc-1 



(12) 



+ 



[as ^ < i + ^] 
[as fc > (1 + e)d/lnd]. 



We use Proposition 13 (i.e. ®) to bound the r.h.s of (fL2l . Noting that for xo = 98(1 + e) - 1 it holds 
that 4>(x ) > 343(1 + e) + 98(1 + e) ln(l + e), it is direct to see that 

Pr[Ex\ < d" 250 . (13) 

As far as Pr[E2\ is regarded, we let NR V be the number of non-rescuable lists. Clearly, it holds that 

Pr[E 2 ] = Pr[NR v > 0] < E[NR V ], (14) 

where the last inequality follows from Markov's inequality. Using (©, it is direct that 

E[NR V ] < (1-S k )d 
< exp 



(15) 



- 3 ^ 1 + £ ^ i^ ) d < exp ( -d 2 ^) 



81nd 

Plugging the above inequality to (fT4l we get that 

Pr[E 2 ] < exp (^-d^+^^j . 

Finally as far as Pr[Es] is regarded, note that 

Pr[E 3 ] < Pr[E 3 \f3 v < 100 lnd] + Pr[/3 V > 100 lnd]. 



(16) 



4 £-1 

We let NC V be the number of bad lists which are corresponded to a partition with less than d 5 x + e special 
lists. Clearly, it holds that 

Pr[E 3 \(3 v < 100 lnd] = Pr[NC v > 0|A, < 100 lnd]. 
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For the rescuable list LP we let h v denote the number of j-special lists that is assigned. It holds that 



Pr[h v < c?s t+7 |/3 W < 100 In d] < Pr 



h„ < 



d~ 



41n 5 d 



< 



exp 



e-1 > 

800 In d 



\P V < 100 In d 

[from CGD] 



4 e-1 

< exp I — d 5 x + e 



It is direct that 

E[NC V \(3 V < 100 In d] < (100 In d)Pr[h v < dm\/3 v < 100 In d] 

( 3 e-1 

< exp I — d 5 !+ £ 

Using Markov's inequality we get that 

Pr[NC v > 0\f3 v < 100 In d] < E[NC v \p v < 100 In d] < exp (-<$ 1 ~ 
Plugging the above inequality and (fT3l) to (fT6l ) we get that 

Pr[E 3 ] < exp (-dtir!) +d~ 250 < 2d~ 250 . (17) 
Plugging in (O, ([15]) and £T7]> to ([Tl]) we get that Pr[A] < 5d~ 250 . The proposition follows. 
3.3 Proof of Proposition |5] 

The inequality in (|9]) follows from the following two lemmas. 

Lemma 4 Consider a random k-colouring ofT. Let v E T, and let L be the list of colours that appear 
in the children of v. For k = (1 + e)d/ In d and for any colour c that is not assigned to v it holds that 

i_ i_ 4d 

\Pr\c L\- d !+ E | < d !+ e ■ -p. 



Proof: Clearly, it holds that 

Pr[c <£ L] = 



1 



k- 1 

d 



< 



exp 



k-1 



[as 1 — x < e x ] 



exp [ --} < d !+ e . 



Also we have that 
Pr[c <£L] = ^1 - 
> exp 



1 



k- 1 
d Ad 



> exp 



d 



k-2 



[as 1 - x > e~ x l^~ x ) for < x < 0.1] 



,__L_ / Ad 
k k*)^ d ^ V-Y^ 



□ 
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Lemma 5 For a vertex v let f v denote the number of colours that are not used by its children under a 
random colouring. For k = (1 + e)d/ In d and for any y £ (0, 1) it holds that 

3y 2 (1 + e) e 



Pr 



< exp 



8 lnd 



(18) 



Proof: Since v is assigned some colour, the number of available colours for its children is k — 1. Using 
Lemma [4] and linearity of expectation we get that 



E[f v ] > (k-l)d~ — (l-4d/k 2 ) 

d^T 



> a + o- 



lnd 



lnd\ 3(1 + e) * 
1-8— > — -+ — ~T~d 1 + e . 
d ) 4 In d 



(19) 



Using a ball and bins argument, we can show that we can apply Chernoff bounds for f v . In particular, 
for any y € (0, 1) it holds that 

Pr[f v <(l-y)E[f v )} < ex V (-^E[f v }^<exp(-^^^-d^ [from©]. 

Clearly, from (fl9T ) we get that 

Pr[/„ < (1 - y)E[f v ]) > Pr 

Combining the two inequalities above we get (fTSl . □ 

It is direct to see that © follows from (fT8l l. where we set y = 1/2, and by noting that 

8 k > l-Pr 



,.s(i-,)!S±^* 



> 1 — exp 



81nd 
-a 1 + E 



32 lnd 

Also, © follows as a corollary from the following lemma. 

Lemma 6 Let V be the number of bad lists. For k = (1 + e)d/ In d, it holds that 

i d lnd In 2 d 
E[P V ] < r < + 



k- 1 ~ 1 + e 



d 



3</.(a:) 



> (1 + x)£[A,]] < dV 4(1+^) 
where (j){x) = (1 + x) ln(l + x) — x, /or x > 0. 



(20) 
(21) 



Proof: Clearly there are d different lists and each of them is bad independently of the others. Let pb a d 
be the probability for the list U to be bad. This means that Ly{i) = c while q £ L l Y . It holds that 



Pbad 



k-l 



1- 1 



k - 1 



< 



1 



k-l 



as 1-1 



fe-i 



< 1. By linearity of expectation we get that 



E%] < d Pbad < 



k-l 



, 1 2 



1 1 2 



lnd ln 2 d 

< 1 • 

" 1 + e d 
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Also, using Lemma|4]we get that 

If i / 4d\\ 3 

In turn, we get that 

3 In 

E\pv] > d Pbad > 4 ^ n +e y (22) 
Applying Chemoff bounds we have that for any x > 

Pr [P v > (1 + < exp (-<£(x) • £[/3„]) , 

where 0(x) = (1 + x) ln(l + x) — x. We get ([211 by substituting the expectation term on the r.h.s. above 
by using the bound (1221) . The lemma follows. □ 



The next two lemmas are sufficient to show (ITOb - 

Lemma 7 Let L x be a rescuable list and let Aj be the set of indices where we check for j -special lists. 
Conditional that Aj is non empty, for some i G Aj, let Qf. be the probability for L l Y to be j-special. Then 
for k > (1 + e)d/ In d it holds that 

p k > — d i+ £ 
9 

Proof: Since L x is rescuable it means that Lx{j) = c and q G L J X while there are colours, apart from 
c, that do not appear in Ll, Let fj be the number of colours, apart from c, that do not appear in L x , 
without conditioning that it is rescuable. Furthermore, so as to have U Y j-special, it should hold that 
Ly(i) ^ L x U {c} and either of the following two holds A) q G L Y and c ^ L' Y or B) q ^ U Y and 
c G L Y . Let the event Q = "L Y (i) £ L j x U {c}". It holds that 

Qk > 2 E[fj } fj >° ] Pr[q $ L y \c G L Y , Q]Pr[c G 4|Q]. (23) 
k — 1 

Noting that £[/_,-] = E[fj\fj > 0]Pr[fj > 0] we get that 

Elfjlfj > 0] > E[fj] > (k - l)dT& (l - jz) , (24) 
where the last inequality follows from Lemma|4] Also, by Lemma|4]we get that 

Pr[c G L Y \Q] > 1 - (l + ^J. (25) 



Working as in the proof of Lemma [4] we get that 



Pr[q i L Y \c G L Y ,Q] > -d'^ (26) 



Substituting the bounds (12511 . (1261 ) and (O in (ED we get 



3 / 4cA 2 / _jl / 4cA\ 10 



^ > M_ d —(1 + ^) 



□ 
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Lemma 8 Consider a random k-colouring of T for k = (1 = e)d/ In d. Let hi be the number of good 
lists that correspond to each rescuable list L? x . For any real c > 0, it holds that 



Pr 



e-l 

h v < - , (3 V < chid 
2c ma 



< 



exp 



e-l \ 

~8clnd 



Proof: It is direct that the number of lists that are assigned to each rescuable list depends on the actual 
number of bad lists. Let j3 v be the number of bad lists. Conditioning that f3 v < clad, for some fixed 
c > 0, it is direct that each rescuable list L 3 X , is assigned a set of at least Lnrhl ~~ non-bad lists to find 
a j-special. Using Lemma[7]we get that 



E[h v \p v <c\nd] > ^d^(^— 

9 \clnfl 



> 



e-l 

cln d 



Also, note that given the rescuable lists, each of the remaining lists are special independently of the other 
lists. Thus, we can apply Chernoff bounds and get the following inequality. 



Pr 



d 1+t 

K < (1 - y)— — - | P v < chid 
cmd 



< 



cxp 



y 2 d^+c 
2c hid 



The lemma follows by setting above y = 1/2. 



□ 



4 Proof of Proposition |U 

The matrix 5 has random entries, let N be its total number of columns. A way of constructing 5, which 
is equivalent to the one described in Remark [8j is the following one: Consider some sufficiently large 
positive integer I « N. We construct 5 in rounds. Assume that after round i — 1 we have constructed 

5 up to some column t, for some t « N. Additionally, let X t = Ylj=i <5 ( 1 , j ) — 5(2, j). Then, during 
the round i we proceed as described in the following paragraph. 

If X t = 0, then we use identical coupling for 5(1, j), 5(2, j) for all t < j < N. If X t > 0, 
then we consider X t many sets of columns of 5 whose values has not been set yet. Each of these 
Xt many sets contains at most I columns. More specifically, the first set R\ starts from column t + 1 
up to column T, the value of T will be defined in what follows. We set the values in each column 
j G R\ by coupling 5(1, j) 5(2, j) such that 5(1, j) = 1 - 5(2, j). T is either the first time that 
Y^j=t+i <S(1) i) — 5(2, j) = —1 or if this is not possible up to column t + I, then we have T = t + I. 
Continue with the second set of columns R\ H and so on. Round i ends after having finished with all 
these X t sets of columns. Then we continue in the same manner with the round i + 

For each set of columns R 1 -, {R l - is submatrix of 5), we have the following lemma which is going to 
be useful in the proof of Proposition |4] 

Lemma 9 Let I > 10, the maximum number of columns of Rj. If the entries are such i?* (l,s) 7^ 
i?* (2, s)for any column s ofRj, then it holds that 



E 



T 



1 + 2^(1,*) -H*.(2,t) 



t=i 



< 



2.3 



7T 



where T is the actual number of columns of R % -. 
5 i?2 starts from the column T + 1 
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Proof: For every t it holds that i?*-(l,t) — JR!-(2, t) is equal to —1 with probability 1/2 or it is equal 

to 1 with probability 1/2. It is direct to see that the partial sums W s = J2t=i ^K 1 ' *) ~ ^K 2 ' s )' for 
s < T constitute a symmetric random walk on the integers which starts from position zero and stops 
either when it hits —1 or after I steps, whatever happens first. We can simplify the analysis and remove 
the dependency from the random variable T, by assuming that W s continues always for I steps and the 
state —1 is absorbing. Then, the lemma follows by just computing E[Wi + 1]- In particular, we have 
that 

E[Wi + 1] = E[Wi + l\Wi + -1] • Pr[Wi + -1]. (27) 

Let T be the step that Wt hits —1 for first time. Using the Reflection Principle we show can that for any 
nonnegative integer i it holds that 

Pr[T = 2i + l]=2^ 2l+1 ^^^. (28) 

i + 1 

It is direct that the Wt cannot be —1 for t even, i.e. Pr[T = 2i] = 0, for every positive integer i. It is 
direct to see that it holds that 

(2i\ 

Pr[Wi = -1] = Pr[T < I) = 1 - Yl 2"( 2i+1 )-^ 



i + l 

i>L(J-i /2J 



To this end we use Stirling approximation, i.e. for a sufficiently large n it holds that n\ = \j2nn (-) e A " 
w ^ tri i2n+i — ^ n — ikrl ■ Then we have that 

2 -( 2i +i)_2)_ < j_ j_ < .[X 

^ i + l - 2J^ ^ j3/2 — v ttZ" 

i>L(/-i)/2j v i>L(J-l)/2j 

Pr[W t = -1] > 1 - JJ. (29) 

On the other hand, it is direct to see that given that the walk Wt does not hit — 1 it is just a random walk 
on the positive integers and it is a folklore result that 



Thus, we get that 




m\z^-i]<^-iy^ + -y ,30) 

The lemma follows by plugging (l29l) and (l30l) into (|27"T ) and taking / > 10. □ 



Proof of Proposition |4j Consider the revelation of the values of the matrix S we gave above. Let tj 
be the index of the column we have revealed up to round i. I.e. at round i + 1 we check whether 
-^tj = S/=i ^(^' j) ~ j) i s zero or not - Let Z the maximum number of columns in each submatrix 
R) be equal to 10. 

Given X ti and assuming that the coupling continuous, i.e. ti the number of columns we have re- 
vealed so far is much smaller than N, we show that it holds that 

E[X U+1 \X U ]<^X U . (31) 

However, before showing the above let us see which are its consequences. Taking the average from both 
sides, we get 

7T \ 7T J 
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since X tl = 1 (it always holds that <S(1, 1) — 5(2, 1) = 1). It is also direct to see that it always holds that 
Xt. < l-Xt. . <l l . That is, in round i we will need to reveal at most l % columns of the matrix. This fact 
implies that the maximum number of rounds j we can have satisfies the condition that Yjt=o ^ — 
Direct calculations suggest that j < = 0.43 In iV, since I = 10. Clearly, the proposition follows 

once we show (f3TT >. For this we are going to use Lemma [9] Notice that given that at round i we have 



Xi 



E*Li (5(l,j)-5(2,j)) 



, for Xi+i the following holds: 

x t - 



X ti+1 =^[l + J>*(l,j) 



s=0 



where T s is the length of the submatrix R\ . From Lemma [9] we have that for any i, s it holds 



E 



i + ^^(i,i)-^(2,i) 



< 



2.3 



7T 



Combining the above two relations and by linearity of expectation we get that 



E[Xt i+1 \X ti ] 
The proposition follows. 



s=l 



l + ^4(l,i)-^(2,i) 



7T 



□ 



5 Rest of the proofs 
5.1 Proof of Lemma [1] 

Consider some integer k and let u ci : [k] d — > [0, 1], for some c± S [k], be the distribution over the 
c?-dimensional vector induced by the following process: A vector S is distributed as in v ci if component 
S(i) is distributed uniformly at random in [&]\{ci}, independently of the other components, for eveiy 
i G [d\. 

Observe that the information we have for L 3 X and U Y is the following: For L 3 X we know that the 
colour c G L J X , q ^ l? x since Lx(j) = q, and there is at least one extra colour in [fe]\{c, q} that does 
not appear in l? x . As far as L l Y is regarded, we know that c G L l Y , q ^ L l Y and Ly{i) is equal to a 
colour that does not appear in L 3 X . 

So as to show the lemma it suffices to show that conditional the colouring of Ly(i), the distribution 
of L J X is identical to the one of L Y . Assume that Ly{i) = s, for some s G [fe]\{c, q}. 

Let the event A ="L Y is j'-good and Ly(i) = s". For any S G [k] d it holds that 

Pr[L Y = S\A] = u s (S\B), 

where B ="there exists t G [d] such that S(t) = c and there is no t G [d] such that S(t) = q". Let Q be 
the number of colours in [fe]\{c, q] that do not appear in L J X . It suffices to show that it holds that 

Pr[L j x = S\s,q$ L j x ,c£ L X ,Q > 0] = u s (S\B). (32) 
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Clearly we have that 

Pr[Q > 0, s,q L J X , c G L 3 X ] 

Pr[s,q^L x , L X = S, c G L\\ 

Pr[s,q^L x , ceL x ] 

= Pr[L j x = S\s, q i L x , c G L j x \. (33) 

In the penultimate derivation we eliminated the event Q > from both probability terms, in the nomina- 
tor and denominator, since whenever s ^ L x holds it also holds that Q > 0. Then, it is straightforward 
that the r.h.s. of (|33T > is equal to u s (S\B), i.e. (1321 holds as promised. 



5.2 Proof of Lemma H 

The lemma follows by just examining the information we have for each of the four lists. As far as the 
i-good pair (L X ,L Y ) is concerned we have the following: Lx(t) is distributed uniformly at random 
among the colours [&]\{c, q} that do not appear in L x while c ^ L x and q G L x . Also, Ly(t) is 
distributed uniformly at random among the colours [&]\{c, q} that do not appear in L l Y while q ^ L x 
and c G L x . 

As far as the i-fail pair (L S X ,L Y ) is concerned we have the following: Lx(s) is distributed uni- 
formly at random among the colours [fc]\{c, q} that do not appear in L l x while q £ L x and c G L s x . 
Additionally, Ly(s) is distributed uniformly at random among the colours [&]\{c, q} that do not appear 
in U Y while c ^ L s x and q G . 

The lemma follows by noting that we have coupled the rescuable pair L x and L Y such that the 
colours in [&]\{c, q} that do not appear in both lists are exactly the same. Thus, we can couple identically 
Lx{t) with Ly(s) and Lx{s) with Ly{t). Then, it is direct that we can couple identically L x with L Y 
and L S Y with LL. 



6 Full Coupling 
Coupling: (x(v),y(v)) 

IF X(v) = Y(v), then couple indenticaly the children of v. 
For each w, child of v execute Coupling(X(u;), Y(w)). 

ELSE do the following: 

Phase 1:- Partial revelation of the lists. 

1. Reveal only for which i we have Lx(i), Ly(i) G {c, q}. Couple the choices of Lx and Ly such 
that if Lx{i) = q, then Ly{i) = c. 

2. For each i such that Lx{i) = q and Ly(i) = c reveal whether U x and U Y is "bad" or not. We 
use coupling such that the result of revelation to be the same for both L l x and L Y . 

3. For each pair of bad lists (L x ,L l y ) reveal whether they are "rescuable". We use coupling such 
that the result of revelation to be the same for both U x and U Y . The coupling is so that the set of 

colours in [fc]\{c, q} that are not used to be the same for both rescuable lists L X ,L Y . 

4. Partition the non-bad L x s and LyS to (maximal) equally sized sets. The partitions are as many 
as the rescuable pairs and each rescuable pair is associated to exactly one partition. That is, the 
rescuable pair (L X ,L Y ) is associated to a set of indices A{ C [d] such that the following holds: 
For any t G Ai L x and L Y belong to partition associated to (L l x ,L Y ). 
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5. For each i and for each j G Ai we reveal if the pair (L J X ,L Y ) consists of i-special lists. We use 
coupling such that either both lists in the pair are i-special or both are not. 

Phase 2: - List Association. 

For each rescuable pair (L x , D Y ) do the following: 

1. Reveal each i-special pairs in A, whether it is i-good or z-fail until either of the following two 
happens: 

• the number of i-good pairs exceeds the number of i-fails by one, 

• there are no other z-special pairs in Ai to reveal. 

2. Reveal the remaining unrevealed i-special pairs, if any, by using identity coupling. 

3. Match every i-good pair with one i-fail such that no two i-good pairs are matched to the same 
i-fail pair. 

4. If the i-good pair (L X ,L J Y ) is matched with the i-fail (L X ,L Y ), then set f(j) = s and f(s) = j. 

5. If there is an i-good pair (L J X ,L Y ) for which there is no z-fail to be matched, match it with the 
rescuable pair (L x , L Y ). Also, set f(i) = j and f(j) = i. 

6. For each j £ Ai such that (L X ,L Y ) is not matched yet, match it to itself and set f(j) = j. 

Phase 3: - Full revelation. 

1. For every s, t such that f(s) = t, couple optimally Lx{s) with Ly(t) as well as L s x with L Y . 

2. Reveal which element of the list Lx is assigned to which child of r and which element of L? x 
goes to which grandchild of r, as Remark [6] specifies. 

3. Assuming that v, child of r, is such that X(v) is set Lx(s), then we set Y(v) equal to Ly(t), 
where t = f(s). Also, for u, child of v, such that X(u) set L s x (j) we set Y{u) equal to L Y (j). 
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